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TARGETED WEB PAGE REDIRECTION 

PRIORITY BENEFIT AND CROSS REFERENCE TO RELATED APPLICATIONS 

This Application claims priority under 35 U.S.C. §1 19(a) to Canadian Patent 
5 Application No. 2,429,761 filed May 23, 2003, which is hereby incorporated herein 
by reference in its entirety. 

TECHNICAL FIELD 

The present invention relates generally to data processing systems, and in 
particular, to redirection of web page requests on the World Wide Web. 

10 BACKGROUND INFORMATION 

The World Wide Web, or WWW, is a hypertext information and 
communication system used on the worldwide network of computers commonly 
known as the Internet. WWW operates according to a client-server model using a 
HyperText Transfer Protocol ("HTTP"). HTTP provides user access to files using a 

15 standard page description language known as HyperText Markup Language 
("HTML"). HTML is an application of Standard Generalized Markup Language 
("SGML"), which is an international standard (ISO 8879) for text information 
processing. Files that are accessed using HTML may be provided in many different 
formats including text, graphics, images, sound, and video. HTML provides basic 

20 document formatting and allows a web developer to specify links to "web servers" 
and files. These links are specified using an addressing scheme commonly known as 
the Uniform Resource Locator ("URL"). By specifying a URL, an end-user or web 
server is able to access web pages formatted according to HTML from a web server 
connected to the WWW. Entering a URL often requires the input of a long string of 

25 characters. To assist the end-user, a bookmarking facility is typically provided in 
web browsers, allowing the end-user to save a URL for a web page being viewed. 
The end-user is then able to later access the bookmarked web page by selecting the 
bookmark from the web browser. 

1 



CA920030034US 1 PATENT 

Over time, a web developer or data owner may wish to alter the organization 
or structure of web pages at a given web site. In the process, frequently, the address 
or URL of the web pages may also change. Consequently, when an end-user tries to 
access a web page for which the URL has changed, the end-user may receive an error 
5 message and be unable to connect to the desired web page. This often results in 
frustration and end-user dissatisfaction with the web site. 

To address this problem, various solutions for redirecting an outdated URL 
request to a new URL have been proposed. For example, U.S. Patent No. 6,338,082 
discloses redirecting an end-user's web browser to a domain name registration site 
10 using the Domain Name System ("DNS") if the specified URL does not exit. As 
known to those skilled in the art, the DNS is a system which translates domain names 
to Internet Protocol addresses. Similarly, U.S. Patent No. 6,014,660 also refers to 
translation within the DNS service. However, redirection through the DNS of more 
than a few web pages may be cumbersome and administratively difficult to set up. 

15 Other proposed solutions have focused on redirection on the web server itself. 

For example, it is known that a web server configuration file in web server software 
may be used to redirect a request for a web page from an old URL to a new URL. 
However, redirecting pages at the web server configuration level may require 
extensive involvement of a web server administrator, which may be inconvenient for 

20 both the web server administrator and the data owner, if they are not one and the 
same. In any event, if there are significant changes to a web page data structure, and 
many web pages must be redirected as a result, using the web server configuration 
file to redirect each web page may prove to be quite impractical. Also, a web server 
configuration file may not permit specific targeting or sufficient flexibility in 

25 mapping old URLs to new URLs as desired by the data owner. 

Thus, what is needed is a solution for redirecting web page requests which 
overcomes at least some of the drawbacks or limitations of known solutions as 
presented above. 
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SUMMARY OF THE INVENTION 

The present invention provides targeted web page redirection which may be 
readily customized by data owners. 

In an embodiment, a web page request from an end-user f s web browser is 

5 redirected to a Common Gateway Interface ("CGI") script for handling the web page 
redirection. In an embodiment, the CGI script includes a routine for matching an old 
URL to a new URL provided in a lookup table. In an embodiment, each web page of 
an old URL has a corresponding new URL such that a web browser trying to access a 
web page using an old URL is always directed to a corresponding new web page, or 

10 to a relevant page if a corresponding new page does not exist. In another 
embodiment, a customized message may be associated with a match between an old 
URL and a new URL, and the customized message may be displayed prior to 
redirection. 

The foregoing has outlined rather broadly the features and technical 
15 advantages of the present invention in order that the detailed description of the 
invention that follows may be letter understood. Additional features and advantages 
of the invention will be described hereinafter which form the subject of the claims of 
the invention. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

For a more ocmplete understanding! of the present invention, and the 
advantages tehreof, reference is now made to the following descriptions taken in 
conjunction with the accompanying drawings, in which: 

5 FIG. 1A is a schematic block diagram of a network which may provide an 

operating environment for exemplary embodiments of the invention; 

FIG. IB is a more detailed schematic block diagram of a web server in the 
network of FIG. 1A; 

FIG. 2A is a schematic diagram of web pages organized into an illustrative 
1 0 tree data structure; 

FIG. 2B is a table with illustrative URLs associated with the illustrative tree 
data structure of FIG. 2 A; 

FIG. 2C is an illustrative lookup table mapping old URLs to new URLs, and 
providing associated messages, if any; and 

15 FIGS. 3 A and 3B are schematic flowcharts of an exemplary embodiment of 

the invention. 
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DETAILED DESCRIPTION 

In the following description, numerous specific details are set forth such as 
specific network configurations, etc. to provide a thorough understanding of the 
present invention. However, it will be obvious to those skilled in the art that the 
present invention may be practiced without such specific details. In other instances, 
well-known circuits have been shown in block diagram form in order not to obscure 
the present invention in unnecessary detail. For the most part, details concerning 
timing considerations and the like have been omitted inasmuch as such details are not 
necessary to obtain a complete understanding of the present invention and are within 
the skills of persons of ordinary skill in the relevant art. 

Referring to FIGS. 1 A and IB, shown is a network 100 which may provide an 
operating environment for exemplary embodiments of the invention. A client PC 102 
may be accessed by an end-user 107 via the user interfaces provided including, for 
example, a display 108, a keyboard 110, and a mouse 112. The client PC 102 is 
connected to the Internet 120 via a suitable Internet connection 122. A web server 
130 is also connected to the Internet 120 by a suitable Internet connection 123. A 
storage unit 140 is connected to the web server 130 and contains a plurality of web 
pages 200, available to the end-user 107. One such web page 202 is shown in the 
display 108 of FIG. 1A. 

The web server 130 may include a central processing unit ("CPU") 132 and an 
associated random access memory ("RAM") 133. As shown, a process 300 may be 
stored in the storage unit 140, retrieved and temporarily stored in RAM 133, and 
executed on CPU 132. The web server 130 may include various I/O modules 
including a network I/O module 134, a user interface I/O module 135, and a storage 
I/O module 136. As shown in FIG. IB, a user interface 150 may be connected to the 
user interface I/O module 135. In an embodiment, the user interface 150 may include 
a monitor, keyboard, and mouse, which allows a web server administrator or web 
developer to access the web server 130. The web server administrator or web page 
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developer may also connect remotely via the Internet 120, or another network (not 
shown) connected to the web server 130. 

Now referring to FIG. 2A, shown is a schematic diagram of an illustrative tree 
data structure for a plurality of web pages 200A. A top level web page 202 branches 

5 into second level web pages 204A, 204B and 204C. Third level web pages 206A and 
206B branch off from second level web page 204A, and each third level web page 
206A and 206B has a plurality of web pages, 208A and 208B respectively, linked 
thereto. Other third level web pages including 206C and 206D branch off from 
second level web page 204B. Still other third level web pages 206D, 206F, and 206G 

10 branch off from second level web page 204C. Each of these other third level web 
pages 206C to 206G may have additional web pages (not shown) linked thereto. 

Referring to FIG. 2B, a table 200B shows an illustrative example of URLs 
associated with the tree data structure shown in FIG. 2A. In the illustrative example, 
the top level web page 202 is associated with a top level URL "www.xyz.edu" of a 
15 fictitious XYZ College. The second level web page 204 A is associated with the URL 
"www.xyz.edu/admissions". (Where a HTML page is not specified, an "index.html" 
page may be assumed.) As shown in FIG. 2B, various other illustrative URLs may be 
associated with the illustrative tree data structure 200 of FIG. 2 A. 

Now referring to FIG. 2C, shown is an illustrative lookup table 200C. 

20 Column 210 of the lookup table 200C is a list of "old URLs" corresponding to the 
illustrative URLs shown in table 200B of FIG. 2B. An additional line 209 is 
appended to column 210 to account for situations where a requested web page is not 
found, as will be explained further below. A second column 220 of the lookup table 
200C is a list of "new URLs" corresponding to the old URLs shown in column 210. 

25 Herein, an "old URL" is an obsolete, outdated or invalid URL to which the web site 
owner no longer desires to grant end-users access to, while a "new URL" is a valid 
URL to which it is desired for end-user traffic to be directed to. For example, old 
URL 208A which reads: 
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"www.xyz.edu/admissions/grad/pages.htmr may have been expanded to new 
web pages "www.xyz.edu/admissions/grad/prograrn/arts/pages.htmr' 

and"www.xyz.edu/admissions/grad/program/science-engineering/pages.htmr f to 
provide more specific admission information by program. When the end-user 107 

5 tries to access the old URL "www.xyz.edu/admissions/pages.htmr, either by directly 
typing in the old URL in a web browser or selecting the old URL from a bookmark 
facility, the web page request may be automatically redirected, as explained in detail 
further below. A third column 230 of lookup table 200C includes a customized 
message corresponding to the redirection between an old URL and a new URL. For 

10 example, message 208 A" provides a customized message which may be provided to 
the end-user 107 when redirecting from old URL 208 A to new URL 208 A'. 

Still referring to FIG. 2C, it is also possible that two old URLs 206F and 206G 
may be merged into one new URL 206H. In this case, more than one old URL may 
be mapped onto one new URL. 

15 While the illustrative lookup table 200C is shown as a flat table in FIG. 2C, it 

will be appreciated that any suitable data structure may be used to map the old URLs 
to new URLs, and to associate messages, if any. 

Now referring to FIGS. 3A and 3B, shown are schematic flow-charts of an 
exemplary embodiment of the invention. In an embodiment, the method 300 may 

20 incorporate a CGI script, as explained further below. As known to those skilled in the 
art, CGI is a standard for interfacing external applications with information servers, 
such as HTTP or web servers. A CGI script may be written in one of a number of 
programming languages, such as C/C++, Fortran, PERL, TCL, a Unix shell, Visual 
Basic, and AppleScript and be executed by a web server. A CGI script is executable 

25 in real-time so that it can output dynamic information. A CGI script may be 
accessible remotely from another web server. For more information on CGI scripts, 
the reader is directed to the official specification for CGI found at the following URL: 
http://hoohoo.ncsa.uiuc.edu/cgi/interface.html. 
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Method 300 starts and proceeds to block 302 at which a web page request is 
received from the end-user 107. The request may be in the form of a URL. For 
example, the end-user 107 may request access to "www.xyz.edu/programs/sciences" 
(block 206F). Method 300 receives the request and parses the URL to identify a 
suitable higher level URL, for example, the top level URL. In this example, the top 
level URL is "www.xyz.edu". Method 300 then proceeds to block 304 at which, 
based on the top level URL "www.xyz.edu", the request from the end-user 107 is 
passed to a CGI script associated with that top level URL. In an embodiment, the 
redirection of the URL request to the CGI script may be accomplished by using a 
"redirect" directive in a web server configuration file. Advantageously, this is the 
only redirection statement required in the web server configuration file. Thus, any 
URL requests having the top level URL "www.xyz.edu" may be automatically passed 
to the associated CGI script. 

Entering the URL processing block 304, method 300 proceeds to block 3 12, at 
which the CGI script reads the specific web page request for 
"www.xyz.edu/programs/sciences" (block 206F). At block 312, the method 300 
parses the URL to identify a specific web page, if possible. 

The method 300 then proceeds to block 314, at which an error check and 
secure routine may be performed. If there is a security issue, such as harmful or 
malicious content in the parsed URL, the web page request may be stopped with an 
appropriate error message displayed (not shown). If the error check routine 
determines that the requested URL cannot be found, the "old URL" may be assigned 
the "value" of "requested web page not found." Otherwise, method 300 may proceed 
to block 316 for mapping. 

At block 316, method 300 may proceed to map the requested old URL to the 
new URL based on the mapping in the lookup table 200C (FIG. 2C). For example, 
the URLs "www.xyz.edu/programs/sciences" (line 206F, column 210) and 
"www.xyz.edu/ programs/engineering" (line 206G, column 210) have both been 
mapped to new URL "www.xyz.edu/programs/science-engineering" (line 206H 1 , 
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column 220). Also, an associated message 206H" explains that "The web page you 
are seeking has been merged. You will be automatically directed to the new page in a 
moment. Please update your bookmark." 

Method 300 then proceeds to block 318, where method 300 may generate 
5 suitable HTML code corresponding to the mapping in block 316. Method 300 then 
leaves URL processing 304 and proceeds to block 306. At block 306, method 300 
may output the message 206H" and then output the new URL mapped from the end- 
user's original web page request. Method 300 then ends. 

In one embodiment, the CGI script may output a web page corresponding to 
10 the new URL directly to the end-user ! s web browser, rather than pass an instruction 
back to the web server for further processing. In this case, it will be appreciated that 
the CGI script will require end-user identifying information to be passed on to it by 
the web server, so that the output can be directed to the end-user 107. 

In another example, in the opposite case where an "old" web page is expanded 
15 into multiple web pages, the redirection should be made to the lowest level web page 
still relevant to the web page requested. In the illustrative example, the old web page 
corresponding to URL "www.xyz.edu/admissions/grad/pages.html" has been 
expanded to include further sub-index pages 

"www.xyz.edu/admissions/grad/program/arts/pages.html" and "www.xyz.edu/ 
20 admissions/grad/program/science-engineering/pages.html" to provide more specific 
admission information by program. Referring to lookup table 200C, if the end-user 
107 has requested the old URL "www.xyz.edu/admissions/grad/pages.html" (line 
208 A, column 210), the new URL will be "www.xyz.edu/admissions/grad/program/" 
(line 208A', column 220). In this case, line 208A 1 has been mapped as the "most 
25 relevant" page in the new web page data structure. When a web page data structure is 
changed in this manner, there may be many such instances in which only the data 
owner knows how the web page mapping should be made. Redirection software that 
relies on pattern matching, or some other non-specific redirection method, may not 
provide the targeted redirection desired. The data owner specified mapping provided 
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by the lookup table 200C of the present embodiment provides accurate, targeted 
redirection of web page requests, which may lead to increased end-user satisfaction. 

In a further example, if the end-user 107 has specified a URL which is not 
found, and the method 300 has assigned to the "old URL" the value of "requested web 
page not found", then at block 316, this "old URL" (line 209, column 210) may be 
mapped to the "home page" or "www.xyz.edu" (line 209', column 220), such that the 
web page request is still directed to a "relevant page". A customized error message 
209" may be associated with this mapping, and may be presented to the user at block 
306, substantially as described above. 

As will be appreciated, the lookup table 200C shown in FIG. 2C may be 
expanded significantly to map many more pages, and associate many more messages. 
Also, it will be appreciated that there is significant flexibility in terms of mapping 
web pages within a lookup table, such as the lookup table 200C in FIG. 2C. Thus, 
hundreds or even thousands of web pages may be redirected in a targeted manner. By 
specifically targeting each request for a web page from an old URL 210 to a new 
URL 220, any frustration experienced by an end-user 107 when an old URL is not 
found may be significantly reduced. Furthermore, as the data content owner will be 
the most familiar with how this mapping should occur, the ability to provide targeted 
redirection to a relevant low level web page may be significantly improved. 

Also, as the CGI script may be more readily accessible to the data content 
owner, the data content owner may provide detailed mapping from old URLs to the 
new URLs with minimal intervention from the web server administrator, and update 
the CGI script whenever required. Also, a customized message may be associated 
with each mapping to provide an enhanced end-user experience. 

In another embodiment, as the web page redirection is handled by a 
customized CGI script which generates the necessary HTML code (at block 3 1 8 of 
FIG. 3B), the lower level URLs which are part of the top level URL may be located 
in many different physical locations. For example, it is possible that a request for a 
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new URL may be directed to another web server (not shown) connected to the 
Internet 130. 

Although the present invention and its advantages have been described in 
detail, it should be understood that various changes, substitutions and alterations can 
5 be made herein without departing from the spirit and scope of the invention as 
defined by the appended claims. 
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